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Abstract 

I We describe a measurement device principle based on discrete iter- 

ations of Bayesian updating of system state probability distributions. 
Although purely classical by nature, these measurements are accompa- 
nied with a progressive collapse of the system state probability distri- 
^>> bution during each complete system measurement. This measurement 

scheme finds applications in analysing repeated non-demolition indi- 
rect quantum measurements. We also analyse the continuous time 
limit of these processes, either in the Brownian diffusive limit or in 



fH quantum systems under continuous measurements. 
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1 Introduction 



Informal and formal similarities between Bayesian inference pQ and quan- 
tum mechanics have been noted quite some time ago, see e.g. [2J. Bayesian 
inference may be seen as a way to update trial probability distributions by 
taking into account the partial information one has gained on the system 
under study. Indirect quantum measurement consists in obtaining partial 
information on a quantum system by letting it interact with another quan- 
tum system, called a probe, and performing a direct Von Neumann mea- 
surement on this probe. Iterating the process of system-probe interaction 
and probe measurement increases the information on the system because of 
system-probe entanglements. 

This has been experimentally implemented in electrodynamics in cavities 
[3j, but also in superconductor circuits [4j. As shown by these experiments, 
repeating a large number of times (formally, infinitely many times) indirect 
non-demolition measurements [5] reproduces macroscopic direct measure- 
ments with collapse of the system quantum wave function. Each collapse is 
stochastic and progressive, becoming sharper and sharper as the number of 
indirect measurements increases. 

Controlling quantum systems [7] by repeating measurements is, in some 
way, as old as quantum mechanics, but it has recently been further de- 
veloped aiming at quantum state manipulations and quantum information 
processing [8J. At a theoretical level, the concept of quantum trajectories 
[3 EI] emerges from the need to describe quantum jumps and randomness 
inherent to repeated measurements. In parallel, studies of open quantum 
systems [TT] led to the theory of quantum feedback [T2] and quantum con- 
tinual measurements [13]. Belavkin equations [2] are stochastic non- linear 
generalizations of the Schrodinger equation adapted to quantum systems 
under continual measurements. 

Contact between experiments of the type described in ref.[3] and classi- 
cal stochastic processes was made in ref.[6], showing in particular that the 
approach to the collapse is controlled by a relevant relative entropy. The 
aim of this note is to follow and complement the study of ref.[6j, by, in 
some way, reversing the logic. We start by forgetting quantum mechanics 
for a while and we study a random process obtained by discretely and ran- 
domly updating a system state probability distribution using Bayes' rules. 
Iterated stochastic measurements refer to this random recursive updating. 
We describe why and how this leads to a stochastic measurement prin- 
ciple allowing to measure the initial system state probability distribution 
but which implements a random collapse of the system state distribution 
at each individual complete system measurement. The initial system state 
distribution is nevertheless reconstructed by repeating the complete system 
measurements. We point out a connection between De Fenetti's theorem 
on exchangeable random variables, see e.g. ref.pj)], and iterated stochastic 
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measurements. We also show that these discrete measurement devices admit 
continuous formulations with continual updating. There are two limits: a 
Brownian diffusive limit in which the random data used to update the system 
state distribution are coded into Brownian motions, this case was studied 
in ref. [16J, and a Poissonian jumpy limit in which these random data are 
coded in point processes. The construction of the continuous time process 
relies on deforming an a priori probability measure on the updating data. 
The key tool is Girsanov's theorem. Then we transport these results, in an 
almost automatic way, to quantum mechanics, and we show that quantum 
mechanical systems under repeated non-demolition indirect measurements 
admit a continuous time limit described by Belavkin equations 
completes results proved in ref. [T7] and makes contact with those described 
inref.[IH]. 

2 Iterated indirect stochastic measurements 

Let S be the system under study and A be a chosen countable set of system 
states a G A that we shall call pointer states The model apparatus is 
going to measure the probability distribution Qo(a), with ^2 a Qo(a) = 1, 
for the system S to be in one of the pointer state. 

The model apparatus is made of an infinite series of indirect partial 
measurements. Let / denote the set of possible results of one partial mea- 
surements, which we assume to be finite or countable. For each system com- 
plete measurement, the output datum is thus an infinite sequence of data 
(*ij*2) • ••)) £ I, associated to the series of successive partial measure- 
ments. The output data are random. The probability distribution Qo(a) is 
to be reconstructed from the sequences (ii, ■ ■ ■ ). 

To be concrete one may keep in mind that the indirect partial measure- 
ments arise from direct measurements on probes which have been coupled 
to the system. The model apparatus is then made of an infinite set of 
in-going probes - which, for simplicity, are supposed to be all identical - 
passing through the system S and interacting with it one after the other. 
Measurements are done on the out-going probes. 

Specifications of the model apparatus depend on the chosen set of pointer 
states. One of its manufacturing characteristics is a collection of probability 
distributions p(i\a), YliiP^W) = 1, for the output partial measurement to be 
i G I conditioned on the system S be in the state a £ A. For simplicity, we 
shall assume a non-degeneracy hypothesis which amounts to suppose that 
all probability distributions p(-\a) are distinct, i.e. for any pair of distinct 
pointer states a and (3 there exists i E I such that p(i\a) ^ p(i\f3). 

5 According to the quantum terminology, but the concept of states is here more general 
as it simply refers to a complete list of labels characterizing the system behavior. 
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2.1 Discrete time description 

In the model apparatus, a complete measurement is made of an infinite series 
of partial measurements such that each output of these partial measurements 
provides a gain of information on the system. Our first aim is to decipher 
which informations one is gaining from the n th first partial measurements. 
This will allow us to spell out the way the model apparatus is working as a 
measurement device. 

• Series of partial measurements and specification of the model apparatus. 
Suppose that the first partial measurement gives result i\ G I. Bayes' law 
then tells us that the probability for the system S to be in the state a con- 
ditioned on the first measurement be i\ is Q\{a\i\) = Qo(a)p(ii\a)/iro(ii), 
with 7To(i) := J2a Qo(oOf>(i|a:), if Qo(ct) is the initial probability for the sys- 
tem S to be in the state a (this probability is yet unknown but shall be 
recovered from the series of partial measurements making a complete mea- 
surement). Let us now ask ourselves what is the probability to get i 2 as 
second output partial measurement? By the law of conditioned probabili- 
ties, 7Ti(z2Ki) = X^ a P(*i^2|a) Qo(«)/^o(*i) with p{ii^i 2 \a) the probability 
to measure i\ and 12 on the two first partial measurements conditioned on 
the system to be in the state a. At this point we need to make an assump- 
tion: we assume that the output partial measurements are independent and 
identically distributed (i.i.d.) provided that the system S is in one of the 
pointer state a £ A. This translates into the relation 

p(ii,i 2 \a) = p(i2\a)p(h\a), 

which implies that irifalh) = Yl a P(^ 2 \ a ) Qi( a \h)- That is the probability 
tti(*2|*i) is identical to the probability to get i 2 as output partial measure- 
ment assuming that the system distribution is Qi(ct\ii). 

Hence, as a defining characteristic property of our model apparatus, we 
assume that the output of the n th partial measurements is independent of 
those of the (n — l)-first outputs provided the system S is in one of the 
pointer state a G A, that is: 

n 

p(h,--- ,i n -i,in\a) = p(i n \a)p(ii,- ■ ■ ,i n -i\a) = JJp(ifc|a), Va. (1) 

k=l 

This specifies our model apparatus. This specification is clearly attached to 
the chosen set of pointer states. 

Conversely, the pointer states associated to this device are those system 
states for which the values of the output partial measurements are inde- 
pendent, i.e. conditioned on the system to be in a pointer state, the output 
variables i\, i 2 , ■ ■ ■ are independent and identically distributed. If the system 
is initially in a pointer state a, that is its probability distribution is peaked, 
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Qo(') = 5-,a, the occurrence frequency u(i) of the value i in the output se- 
quence (ii, *2 9 • • • ) is p(i|a). As we shall see later, one may then identify 
the pointer states as the system states for which independent infinite series 
of partial measurements (i.e. independent complete measurements) provide 
identical occurrence frequencies v(-), and this gives a way to calibrate the 
device and to determine the conditioned probabilities p(-\a). 

If the system is not in a pointer state, its initial distribution Qo(a) - 
to be determined - is un-peaked. Let Q n (oi\ii, ■ ■ ■ ,i n ) be the probability 
for the system to be in the state a conditioned on the n-first output partial 
measurements be ii,i2,--- i*n- From our hypothesis (Q, the probability 
to get i as the n th output conditioned on the (n — 1) first outputs be 
(k, ■ ■ ■ ,«'ti-i) is 

n n -i{i\h, ■ ■ ■ >%-i) = ^2p(i\a)Qn-i(ai\ii,--- ,i n -i)- (2) 

a 

By Bayes' law, the probability for the system to be in the state a conditioned 
on the n-first measurements be ii,%2, • • • , in is then recursively computed by 

n { \- • \ p(in\a) Q n -i(a\k, ■ ■ ■ ,i n -i) ro , 

Qn(a\%i, ■■■ ,i n ) = y^y. ; v , (3) 

TTn-l^nPlj • • • ,1>n-l) 

where 7r n _i is the probability to get i n as the n th output. To simplify 
notations we denote Q n (a\ii, ■ ■ • ,i n ) by Q n (a) and 7r n _i(i|ii, • • • , i n -i) by 
TTn-l(i)- Eq.Q can be solved explicitely: 

with N n (i) the number of times the value i appears in the n th first outputs. 

Let us point out an interesting reformulation of the above conditions 
on the outputs of the model apparatus. A sequence (11,12, ■••) of random 
variables is called exchangeable if the distribution of (11,12,' •■ , in) is the 
same as the distribution of (i ai , i a2 , ■ ■ ■ , i an ) for each n and each permutation 
a of [1, 2, • • • , n]. A remarkable theorem due to De Finetti (see e.g. ref.|15j 
or the last two items of ref.[19]) asserts that an infinite sequence (ii, 12, ■ ■ ■ ) 
of random variables is exchangeable if and only if there is a random variable 
A such that, conditionally on A, «2, • • • ) is a sequence of independent 
identically distributed random variables. In our construction, the values 
taken by A are nothing but the pointer states and the measure on A is Qq. 
So the hypotheses on the model apparatus can be rephrased as the fact that 
the order of partial measurements is immaterial. 

More concretely, let O be the data set of all complete measurements. 
This is made of all infinite series u := (11,12, ■■ ■ ), ik £ I, of output par- 
tial measurements. We may endow £1 with the filtration T n of cr-algebras 
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generated by the sets ,i„ := {w = ■ ■ ■ , i n , anything else) £ O}, i.e. 

codes for the knowledge of the n th first partial measurements. This fil- 
tered space is equipped with a probability measure recursively defined by 
F[i n = i|J>i_i] = 7r n _i(z). Notice that, given Qo(a), this probability mea- 
sure decomposes as a sum 

P = ^Qo(a)P Q 

a 

where P Q will be the probability measure induced on 0, if the system hap- 
pened to be initially in the pointer state a, i.e. if Qo(-) is peaked at a. 
Under ¥ a the partial outputs are independent random variables so that 

n 

fa[B h ,.., in ) = l[p(ik\a). 

k=l 

Let us then quote properties of the random probability distribution 
Q n (-), which will be keys for specifying the model measurement device: 

(i) Peaked distributions are stable under the recursion relation ^ . That 
is, if Qo(-) = 5. - a then Q n {-) = 5.- a for any n. 

(ii) Given Qo(-) generic, the random variables ^n(c^) converge as ti goes 
to infinity almost surely and in L . The limiting distribution Qoo(") is peaked 
at a random target pointer state. That is: 

Qoo (') = ^-;7w 

with target pointer state 7^ depending on the event ui. The probability for 
the target to be a given pointer state a is the initial probability distribution: 

F[lu J = a} = Qo{a). 

(iii) The asymptotic occurrence frequencies u(i) := \vm n N n (i)/n, with 
N n {i) the number of times the value i appears in the n th first outputs, are 
those of the target pointer state. That is: 

lim N n {i)/n = p(i|7o,). 

n— >oo 

(iv) The convergence is exponentially fast: 

Q n {a) ~ exp(-nS(ju\a)), a ^ 7^, 

for n large enough, with 5(7^10) the relative entropy of p(-|7 w ) relative to 
p(-\a). 

These facts have been proved in ref.[6]. They are based on the fact 
that the random variables Q n (a) are bounded P-martingales with respect 
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Figure 1: A schematic view of iterated stochastic measurements: probes are 
send one after the other to interact with the system for a while. After the 
interaction, a measurement is performed on each probe. The information 
gained is summarized in the occurrence frequencies, which allow to identify 
the limiting state. 

to the filtration F n . That is E[<3 n (a;)|.F n _i] = Qn-ii®)- A classical the- 
orem of probability theory [19] says that a bounded martingale converges 
almost surely and in L 1 , so that Qoo(ct) '■= lim n Q n (a) exists and Q n (a) = 
^[Qoo(ot)\J- n ]. More general results, involving for instance extra randomness 
on the partial measurements or relaxing the non-degeneracy hypothesis on 
the conditioned probability p(-\a), have been obtained in ref.|16j. 

• How to read-off a complete measurement and consequences. Let us 
summarize how the model apparatus is (concretely) working and how data 
are analysed, see FigJTJ For a given system measurement, the data is an 
infinite sequence to = (ii,t2, • • • ) of output partial measurements. From its 
asymptotic behaviour, the apparatus computes the asymptotic frequencies 
v(i) of occurrences of the values i in the sequence w, and it compares it to 
one of the apparatus data-base distributions p{i\a). By the non-degeneracy 
hypothesis and the above convergence theorem [6], each of the asymptotic 
frequencies coincide with one of the data-base distributions, so that the 
comparison identifies uniquely the target pointer state and that identified 
state is by definition the result of a complete system measurement. Since by 
the above theorem the distribution of the target pointer states is the initial 
distribution Qo(-), the histogram of repeated independent complete system 
measurements yields the initial distribution. 

Notice that by the end of a complete measurement the system state 
distribution has collapsed into one of the pointer states. The need for an 
infinite series of partial measurement reflects the need for a macroscopic 
apparatus to implement the collapse. If the system measurement is stopped 
after a finite number of partial measurements the collapse is only partial, 
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i.e. the probability distribution Q n (-) is still smeared around the target 
pointer state. The target pointer state may nevertheless be identified with 
high fidelity if the differences between the data-base probability distribu- 
tions p(-\a) are bigger than the fluctuations of the frequencies v n (-) which 
generically scale like n -1 / 2 . 

2.2 Continuous time limit 

We now describe continuous time limits of the previous model apparatus in 
which the partial measurements are done continuously in time. There are 
different continuous time limits, depending on the behaviour of the data- 
base conditioned probability distributions p(-\a): a Brownian diffusive limit, 
a Poissonian jumpy limit, or a mixture of them. 

These limits may be understood by looking at properties of the counting 
process N n (i) := Ysk=i ^k=« which is the number of times the value i appears 
in the n th first partial measurements. Recall that 7r m _i(i) = E[Ii m= j|.7 : ' m _i] 
is the probability to get i as the m th partial output conditioned on the 
(m — l) th first partial outputs. We may tautologically decompose N n {i) as 

n-1 

N n (i) = X n (i) + A n (i), with A n (i) := ^ vr m (i), (5) 

where this equation serves as the definition of X n (i), i.e. X n {i) := N n (i) — 
A n (i), with ^2 i X n (i) = as both N n (i) and A n {i) add up to n. Then by 
construction JL[X n (i)\F n -\\ = X n -i(i), so that the processes X n (i) are P- 
martingales with respect to the filtration T n . Equation ^ is the so-called 
Doob decomposition of N n (i) as the processes A n (i) are predictable, i.e. 
A n (i) is J^-i-mesurable, see ref.[19]. The martingale property in particular 
implies that E[X n (i)] = 0. 

Recall now the recursion relation ^ that we may rewrite as 

Qn{oi) - Qn-i(a) = Q„_i(a) V] P( l \ a ) (l in=i - 7r n _i(i)) 

i 

which holds true because 'Yl li p{i\a) = 1. By construction — 7r n _i(i) = 
X n {i) — X n -i(i), so that 

(AQ) n (a) = g„_x(a) ^ (AX) n (i). (6) 

with (AQ) n (a) := Q n (a) - Q n -i(a) and (AX) n (i) := X n (i) - X n ^(i). 
We thus have rewritten the recursion relation ^ as a discrete non-linear 
difference equation for the probability distribution Q n { ) driven by discrete 
differences of the martingales X n (i). This will be the starting point of the 
continuous time limits. 
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Before going on let us point out a geometrical interpretation of ^n(c^) 
which will be useful later. On the set of complete measurements, we have 
defined a global measure P and a series of measures P a associated to each 
of the pointer states with P = J2 a Qo(o:)Pa- ^ is a simple matter to check 
that P Q is non singular with respect to P, so that there exists a Radon- 
Nikodym derivative of P a with respect to P, see ref.|19|. This derivative is 
Qoo(oi)/Qo(a). More concretely, for any J^-measurable integrable function 
X, 

Q (a)E a [X]=E[Q n (a)X], 

with Q n {a) = E[<5 00 (a)|J r n ], as may be checked directly. 

We may tautologically refine this geometrical construction. Let us start 
from an arbitrary probability measure P° on S7, and let us set 3» 1 ... := 
P°[i3j lj ... t i n ], assuming that none of these probabilities vanish. Let Z n and 
Z n {a) be J^-measurable functions defined by 

Z n (a)(ii, • • • ,i n ) := Xu-,^ H?>(»fc| a )j Zn := /^2Qo( a ) Z n{a), 

k a 

so that Q n (a)/Qo(a) = Z n (a)/Z n . It is easy to check that each Z n (a) 
is a P°-martingale, E°[Z n (a)| F n —l] = Z n -\{ct). As it is clear from their 
definition, the Z n (a)'s are the Radon-Nikodym derivative of the measures 
P a 's with respect to P° on J^-measurable functions, that is 

E a [X]=E°[Z n ( a )X], 

for any J^-measurable function X. Of course, Z n is the Radon-Nikodym 
derivatives of P with respect to P°, i.e. ELY] = E°[Z n X] for any im- 
measurable function X. Choosing adequately P° helps taking the continuous 
time limit, a fact that we shall illustrate below. 

2.2.1 Brownian diffusive limit 

The Brownian diffusive limit occurs when the conditioned probability p{-\a) 
depends on an extra small parameter 5 such that 

p(i\a) ^5^0 Po(i) (l + vS T(i\a) H ) , 

with all poW s non vanishing and a-independent. Since ^2iPo(i) = 1, 
the Po(0' s define an a-independent probability measure on /. Note that 
^2iPo(i)F(i\a) = for all a since ^2iP(i\a) = 1 for all 5. By the non- 
degeneracy assumption, the functions T(-|q) on I are all different. 

The continuous time limit is then obtained by performing the scaling 
limit S — > 0, n — > oo with t := n/5 fixed. 

To understand the scaling limit of the counting processes N n (i), let us 
look at its behaviour under F a , i.e. for a system in the pointer state a with 
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initial distribution Qo(") = Then by hypothesis the output partial mea- 
surements are random independent variables, so that N n (i) = Ysk=i ^fc=« * s 
the sum of n independent identically distributed (i.i.d.) variables with 
value 1 (if the output of k th partial measurement is i) with probability 
p(i\a) and zero (if the output of k th partial measurement is different from i) 
with complementary probability. By the law of large numbers, the N n (i)'s 
at large n become Gaussian processes with mean np(i\a) and covariance 
min(n, m) (p(i\a)5i-j — p(i\a)p(j\a)). Under these hypotheses, the probabil- 
ity 7r m-i(^) for the m th output partial measurement to be i is p(i\a) for all 
m, so that A n (i) = np(i\a). Hence, under P a and for such peaked initial 
distribution, the law of the processes X n (i) at large n is that of Gaussian 
processes with zero mean and covariance 

E a [X n (i)X m (j)] = min(n,m) (p(i\a)5i-j - p(i\a)p(j\a)), for Q (-) = d.- a . 

After appropriate rescaling, this clearly admits a finite limit as 5 — > which 
is a-independent. Hence under this hypothesis, X n (i) admits a continuous 
time limit Xt(i), formally to be thought of as lim^o V6X^ t /^(i). However, 
the previous equation is not enough to describe this limit under the law P 
and some care has to be taken, see ref.|16j. 

So, let us define the scaling diffusive limit of the state distribution and 
the Doob martingales: 

Qt(6) := lim Q [t m(a), X t (i) := lim VSX [t/s] (i), 

and of the counting processes, 

W t (i) := lim VS(N [t/5] (i)-p (i)t/5). 

These equalities have to be thought in law, but we shall still denote by 
P = J2 a ( 3o(«)IP > o the probability measure for the continuous time processes. 
By construction, Xt(i) are P- martingales. 

The discrete difference equation ^ naively translates into the non-linear 
stochastic equation for Qt(a). Recall that in the diffusive limit, p{i\a) ~ 
Po(i)[l + V6T(i\a) + •••] as 5 goes to zero, so that 7r n _i(i) ~ po(i)[l + 
yj~8 (T(i))t + • • • ] with (r(i))t := J2 a T(i\a) Qt(a). In the continuous time 
limit, eq.Q then becomes: 

dQt(a) = Qt(a) ^ (r(»|a) - (F(i)) t )dX t (i) (7) 

i 

with Ito's convention. We used J2i Xt(i) = and X^iPo(*)r(^|«) = to take 
this limit. Remark that this equation preserves the normalisation condition 
^2 a Qt(oi) = 1. This equation is that which governs the evolution of the 
system probability distribution under continuous Bayes' updating in the 
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diffusive limit. The random fields Xt{i) code for the information of the 
continuous time series of partial measurements. Not all of these fields are 
independent since = 0- As we shall see, the main feature of the 

Brownian diffusive limit is that the Xt(z)'s are Gaussian processes with zero 
mean and covariance 

E[X t (i)X s (j)} = min(t, s) (po(i)*irf " Po(*)Po(?)). ( 8 ) 

Alternatively, the fields Xt(i) are zero mean Gaussian martingales with 
quadratic variation 

dX t (i)dX t (j) = (po(i)Si;j -po{i)po(j))dt, 

which is of course compatible with the relation £\ Xt(i) = 0. Actually the 
proofs of equation ^ and of the correctness of Q are a bit tricky, see [16] 
for details. We shall here present an alternative less rigorous but quicker 
and simpler argument. 

Let us now argue for eq.Q. Recall the Doob decomposition of the 
counting process N n (i) = X n (i) + A n (i). Its naive scaling limit reads 

W t (i) = X t (i)+ I dsp (i)(T{i)) s , 
Jo 

whose infinitesimal differential form is 

dWtii) = dX t (i) + p (i) (T(i)) t dt (9) 

Contrary to the Xt(£)'s 3 the Wt(i)'s are not P-martingales but they are 
globally defined and independent of the initial distribution Qo(-) because 
they are defined as limit of the counting process. We however know that, 
under P a , the Wt(z)'s are Gaussian processes with mean and covariance 

E a [W t (i)] = t Po (i)T(i\a), 
Cov a [W t (i)W s (j)] = mm(t,s)(p (i)5rj - p (i)poU)). 

We now would like to use this information to read off properties of the 
martingales JQ(i)'s. The key point consists in using Girsanov's theorem |19j . 
Recall that (J 00(a) may be thought as the Radon-Nikodym derivative of IP a 
with respect to P and that Qt{pi) = E[Q 00 (a)| J-t]. Assume for a while that 
the .Xt(z)'s are Gaussian processes under P with zero mean and quadratic 
variation G(i, j)dt := (dXt(i), dXt(j)} to be determined. Girsanov's theorem 
tells us that modifying the measure P by multiplication by the martingale 
Qt{ct) adds a supplementary drift in the stochastic differential equation Q, 
given by the logarithmic derivative of the martingales. In the present case, 
using eq.([7]) Girsanov's theorem implies that 

dW t {i) = dX t (i)+ Po (i) (T(i))tdt + J2G(i,j){r(j\a) - (T(j)) t )dt, 

j 
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with Xt(iys Gaussian processes under ¥ a with zero mean and identical 
quadratic variation G(i,j)dt. Comparing now with the known properties 
of Wt(i) under P a , spelled out above, we deduce that G(i,j) = (po(i)5i ] j — 
Po(i)po(j)), as claimed [^J so that the previous equation reduces to 

dW t {i) = dX t (i) + p (i)T(i\a)dt, 



under P a , as required. This ends our argument for equations (7|8). 



A way to rigorously construct processes with all the above properties is 
to deform a suitable a priori measure P°. Details have been given in ref.|16|. 
In this note, we shall illustrate this strategy in the Poissonian case. 

Eq.Q may actually be integrated explicitly, see ref.|16j. Furthermore, 
as bounded martingales the Qt(a)'s again converge almost surely and in L 1 . 
Under the non-degeneracy assumption that all T(-|a) are different, the limit 
distribution is peaked, Qoo( - ) = <5-;7^> a ^ a random target pointer state. The 
convergence is still exponential. 



2.2.2 Poissonian jumpy limit 

The Poissonian limit occurs when the conditioned probabilities p{i\a) vanish 
as a small parameter 5 vanishes. Not all p(i\a)'s may vanish simultaneously 
as they sum up to 1. So let us single out one value i* for which p{i\a) goes 
to 1 as 6 — > for all a and assume that all other p{i\a) vanish in this limit: 

p(i*\a) ~5_>o 1, p(i\oi) ~5 89(i\a) for i ^ i* , Va. 

By consistency p(i* \ a) = 1 — 5(^2^* 0(i\a)) + o(5) and all 6(i\a) are positive 
and assumed to be non- vanishing. In the limit 5 — > 0, the output of the 
partial measurements is most frequently i* with sporadic jumps to another 
value i different from i* 

The continuous time limit is obtained by performing the scaling limit 
6 — > 0, n — > oo with t = n/5 fixed. 

To understand the continuous time limit of the counting processes N n (i), 
let us again look at its behaviour under F a . As before, the output par- 
tial measurements are then random independent variables, so that N n (i) = 
Sfc=i ^ik=i 1S sum of n independent identically distributed variables €k{i) 
with value 1 with probability p{i\a) and zero with complementary proba- 
bility. Let us first consider i ^ i* and compute logE Q ,[e 2Ar ' l( -^] = nlog[(l — 
p(i\a) + e z p(i\a)]. In the scaling limit with n = t/5 and p(i\a) ~ 5 9{i\a) we 
get 

limlogE a [e zJV [*/*]«] = W(i\a) (e z - 1), i± i* . 

8— s>0 

6 This expression for the quadratic variation of the X t (i)'s is compatible with the rela- 
tion Xt(i) = 0, since G(i,j) = as it should. 

7 We may generalize this by assuming that more than one conditioned probabilities 
remain finite as S goes to zero. In these cases, the continuous time limit will be a mixture 
between the Brownian and Poissonian limits. 
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Similar computations, based on the general formula 

k / \ ni—ni-i 



Ef =1 Ei ^(OC^W-JVn,,^*)) 



] =n e^pM") 



i=i 

for ft > 1, arbitrary non-decreasing sequences of integers = no < n\ < 
■■■ < rik of length ft, and arbitrary (complex) z/(i)'s, show that, in the 
limit S — > 0, the P a -distributions of the counting processes JVr t /5](i), z 7^ i*, 
converge to those of independent Poisson point processes with intensities 
0(i\a) dt. Note that this statement is true under P a but not under P. How- 
ever, we can compute their IP-generating functions using the decomposition 
of the measure P = ^ Q Qo(a)P a . For instance 

E[ e *^/flW] ~ s _^ ^Q (a)e^l Q )( eZ - 1 ), i + i*. 

a 

The properties of N n (i*) and their limits are reconstructed using the 
sum rule, X^^nCO = n - I 11 particular, for small 5, Nu/g\(i*) ~ t/5 up to 
order 1 random corrections. 

So, let us define the scaling Poisson limits of the state distribution and 
of the Doob martingales X n (i)'s, 

Qt(a) := lim Q Y t (i) := lim Xu /s] (i) 

and of the jump counting processes 

N t {i) := lim Nu/ S ](i), for i ^ i* , 

and M t (i*) := lim^o (-W[t/ai(**) ~~ V^)- Again, these equalities have to 
be thought in law, but we still denote by P = ^2 a Qo(a)P a the probability 
measure for the time continuous processes. By construction, the martingales 
Y t {i) sum up to zero, £^it(i) = 0, and have zero mean, E[Y" t (i)] = 0. 
Similarly, M t (i*) + = 0. 

Again, the naive scaling limit of the difference equation ^ yields a 
stochastic equation for the system state distribution. In the Poissonian 
limit, one has p{i\a) ~ 56{i\a) + • • • for i ^ i* as 5 — > 0, so that ir n -i(i) — 
5(8(i))t + --- with (9(i))t '■= Y2 a ^(A a )Qt( a )) f° r * 7^ **) whereas both 
and 7r n _i(i*) approach 1 as S goes to zero. The continuous time 
limit of eq.Q is then 

dQ t (a) = Q t (a) £ _ 1) d y t(i) . (10 ) 

where we used <ilf(i*) = — Yltyi* ^^t(0 to deal with the term associated 
to i* in eq.Q. As we shall show just below, the Y((z)'s are related to the 
counting processes by 

dN t (i) = dY t {i) + {6{i)) t dt, i + (11) 
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We shall furthermore argue that the processes dNt(i), i ^ i* , are point 
processes with intensities (9(i))tdt. This intensity is sample dependent - a 
point that we shall explain -, but predictable. Equations ( 10|11 ) are those 



which governs the evolution of the system probability distribution under 
continuous Bayes' updating in the Poissonian limit. The random counting 
processes iVj(i) code for the informations on the continuous time series of 
partial measurements. 



Let us now argue for eq. ( 1 1 ) . Consider again the Doob decomposition 
N n (i) = X n (i) + A n (i). Because (*) — $ {Q(i))t for small 5, its naive 
scaling reads 

N t (i) = Y t {i) + / ds(e(i)) s , i^i*. 



o 



Its infinitesimal version is eq.( 11 ), as announced. Since p(i*\a) ~ 1— 6a(i*\a) 
with a{i*\a) := J2i^i* 9{A a )-> the counting function N n {i*) slightly deviates 
from n, and M t (i*) = Y t (?)-f* ds (a(i*)) s with {a(i*)) a := £ Q <r(i*\<x)Q a (at) 
By the martingale property, E[dYj(i)| Ft] = so that 

E[dN t (i)\T t ] = (6(i)) t dt, i + 

That is the number of jumps in the direction i in the time interval [t, t + dt) 
depends on the past of the process and is equal to (9{i))tdt in mean. We 
may go a little further and compute the generating function of those jumps. 
Indeed, since the conditional measure E[-|.Ft] decomposes as a sum, E[-|.Ft] = 
^2 a Qt(a)K a [-\J 7 t ], and since under E a [-|^i] the dNt(i)'s are Poisson point 
processes with intensity 6(i\a)dt, we have 

logE[e zdN ^\T t ] = dt (9(i)) t (e z - 1), (12) 

with (9(i))t = Y^,a Qt(oi)6(i\a). That is, under P, the dNt(i)'s are point 
processes with intensities (9(i))tdt, as announced. As above, a similar com- 
putation shows that the diVt(i)'s, for fixed t, are independent variables for 
i ^ j under P Q but not under P. An alternative description of this limit is 
given in ref.[17j. see also the forthcoming ref.[21j. 

Up to now, our arguments have been only in law. A rigourous construc- 
tion^] of processes, living on a well-defined probability space, and having all 
the required properties, is to deform a suitable a priori measure P°. The 
hint that this is possible is the formula for Qt{a) obtained by taking the 
continuous time limit of eq. Q . There are some cancellations of powers of 5 
between numerator and denominator yielding 

Qtia) = Q (a) with Z t :=^Q (P)Z t (P), 

1 13 



3 Which is an alternative to ref.[T7] in that case. 



14 



where 



Zt (a) :=l[e(i\a) N ^e- t ^- 1 \ 



(13) 



One recognizes Zt(a) as the standard exponential Poisson martingale. So, let 
us start from an a priori probability measure P° accommodating for indepen- 
dent Poisson processes Nf(i), i ^ i*, of intensity dt. Define F a := Zt(a)¥° 
on J-f Then, under ¥ a , the A^(i)'s are independent Poisson processes with 
intensity 6{i\a)dt. Defining 



it is plain that the Qt(a)'s are P-martingales and the Nt{i) J s have the law we 
were after. For instance, since Qt(a) = Qo(a) Z t (a)/Z t , we have IE [- 1 -7-^] = 
J2 a Qt( a ) E a[-\^t] sothat(iiV t := N t+dt (i) - N t (i) is at most 1 and F[dN t (i) = 
l\T t ) = dt{9(i)) t . 

3 Iterated indirect quantum measurements 

Although purely probabilistic - involving classical probability only - the pre- 
vious description of iterated stochastic measurements finds applications in 
the quantum world, in particular in the framework of repeated indirect non- 
demolition measurements [5] . Recall that an indirect quantum measurement 
consists in letting a quantum system interact with another quantum system, 
called the probe, and implementing a direct Von Neumann measurement on 
the probe. One then gains information on the system because the probe 
and the system have been entangled. Repeating the cycle of entanglement 
and measurement progressively increases the information on the system as 
in the model apparatus we described above. 

Let S be the quantum system and % s be its Hilbert space of states. Pick 
a basis of states {\a}} in 7i s , which are going to play the role of pointer states. 
Let V be the probe and H p be its Hilbert space. We assume that the probe- 
system interaction preserves the pointer states: a system initially prepared 
in one of the pointer state remains in this state after having interacted with 
the probes. This requires a peculiar form for the unitary operator U of the 
probe-system interaction: 



with U a an unitary operators on % v . Alternatively, U\a) (8) \u) = \a)(g) U a \v) 
for any \u) G H p , a property coding for the fact that the pointer states \a) 
are preserved by this interaction. 



P:= C£Qo(a)Z t (a))F°, 



a 




(14) 



a 
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We imagine sending identical copies of the probe, denoted Vi,V2, • • • , 
one after the other through the system and measuring an observable on 
each probe after the interaction. We assume that the in-going probes have 
all been prepared in the same state £ Hp, and that the observables 
measured in the out-going channel are all identical with non-degenerate 
spectrum I. Let {\i}} £ H p , i £ I, be the basis of eigenstates of the mea- 
sured observable. We denote by the output of the measurement on the k th 
out-going probe. In analogy with previous section, we call the cycle entan- 
glement and measurement on a probe a partial measurement. The results of 
repetitions of theses cycles of partial measurements are random sequences 
(h, *2> ' ' ' ) 5 ik £ I- As before, such infinite series of partial measurements 
will be called a complete measurement. The unitary operator U codes for 
the probability of measuring a given value i on the out-going probe. Suppose 
that the in-going probe has been prepared in the state \ip) and the system 
S in the state \a). After interaction, the system- probe state is \a) g) U a \ip) 
and the probability to measure the value i of the probe observable is 

p(i\a) := \(i\U a \^)\ 2 , 

by the rule of quantum mechanics. So |(i|C/ a |^)| 2 is the probability to mea- 
sure i in the out-going channel conditioned on the system state be \a). The 
analogy with the previous section should start to become clear. 



3.1 Discrete time description 

Let p be the system density matrix. The system state probability distri- 
bution is Q(a) = (a\p\a). The aim of this section is to describe how the 
system state distribution and the density matrix evolve when the cycles of 
entanglement and measurement are repeated, and to make explicit contact 
with previous sections. 

Assume that the system is initially prepared in a density matrix state 
po, and let us look at what happens during a cycle of entanglement and 
interaction. Recall that the probe is assumed to be prepared in the density 
matrix state After interaction, the joint system-probe density matrix 

is U pq®\iI)) {ip\U^ . The observable, with spectrum /, is then measured on the 
probe. If i\ is the output value of this measurement, the joint system-probe 
state is projected into p\ ® with 



Pi := 



{h\U\iP) po 

This occurs with probability 7ro(zi) = Tr( (ii\U\^) po {if)\U^\ix) ). Using the 
assumed property of U, eq.(14), this can rewritten as 



7T (i) := Tr( (i\U\i>) po {if>\U*\i) ) = J>(i|a) Q (a). 
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How this cycle is to be repeated is clear. Let p n -\ be the system den- 
sity matrix after the n — 1 first partial measurements - this density ma- 
trix depends on the random values of these measurements, so that p n -\ = 
pn-i(h, • • • , in-i), but we simplify the notation by not writing explicitly the 
values of the measurements. We let the system interact with the n th probe 
and do a measurement on this probe. If i n is the output value of this n th 
partial measurement, the system state is projected into 

Pn = ^—(inlUl^Pn^^lU^in), (15) 

where again we simplified the notation by not writing the values of the 

partial measurements - p n should have been written as p n (i 

and similarly for 7r n _i. This projection occurs with probability 7r ri _i(z n ), 

with 

7r„_i(i) := Tr( ii\U\^) pn-! ) = £p(t|a) Q„-i(a). 

a. 

The diagonal matrix elements of the density matrix are the probabilities for 



the system be in a pointer state, that is Q n {ot) = (ce\p n \ot). From eq.(15) we 
read that 

^ f x p(i n \a) Qn-i(a) 
The two above equations exactly coincide with eqs.(2j3) defining iterated 



stochastic measurements. So everything we wrote in the previous sections 
applies. In particular the collapse of the system probability distribution is 
a discrete implementation of the wave function collapse in Von Neumann 
measurement. The quantum system observable measured by the iteration 
of cycles of entanglement and indirect measurement is that with eigenstate 
basis The collapse happens only for an infinite sequence of partial 

measurement reflecting the fact that the iterated stochastic measurement 
apparatus is macroscopic only if an infinite sequence of partial measurements 
is implemented, see ref.[6]. 

3.2 Continuous time limit 

The aim of this section is to take the continuous time limit of the discrete 



recurrence equation (15) for the quantum density matrix using the results 
of the previous section. Doing this we will make contact with the so-called 
Belavkin equations [13], describing continuous time measurements in quan- 
tum mechanics and which are non-linear stochastic Schrbdinger equations 

The small parameter 5 is the time duration of the system-probe inter- 
action, so that the unitary operator is U = exp(—i5H) with H the system- 
probe hamiltonian |^} As is well known, the dynamics of a quantum system 

9 We use the notation i, without a dot, to code for the square root of — 1. 
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under continuous measurements is frozen by continuous wave packet reduc- 
tions, a fact named the quantum Zeno effect. To avoid it, we have to rescale 
the system-probe interaction at the same time we decrease the interaction 
time duration. So we assume the following form of the hamiltonian H, 

H = H s (g> 1 + 1 (g> H p + —j= Hi, (16) 

where H s is the system hamiltonian, H p the probe hamiltonian and Hi the 
interaction hamiltonian. 

For the pointer state to be stable under the action of U = e~ lSH , 
eq.(|14[), we should assume that H s is diagonal in the pointer basis, H s = 
Y2 a \ ot)E a (a\ for some energies E a , - this is linked to the non-demolition 
character of the measurement - and that 



Hi = \a)(a\®H a , 



with H a acting on % p but a dependent. The conditioned probabilities p{i\a) 
are then 

P (i\a) = \(i\i>)-iV5(i\H a \^) + ---\ 2 , 

so that the Brownian diffusive case corresponds ^ and the Poissonian 
jumpy case to = 0. 

In both cases, the continuous time limit is then obtained by performing 
the scaling limit 5 — > 0, n — > oo with t := n/8 fixed as above. 



It is useful to recast the quantum recursion relation ( 15 ) into a difference 



equation. This simplifies matter when taking the continuous time limit. Let 



us write p n = ^ p n {in) ~h n =i with p n (i n ) defined in eq.(15). Recall that 
EpL, n= j|.F n _i] = 7r n _i(i) and write I in=i = (I in=i - w n -i(i)) + 7r n _i(i). This 
leads to the Doob decomposition of the difference p n — p n -\ as, 

p n - p n -i = (Dp) n ^i + (Ap) n , (17) 

with (Dp) n -i := K[p n \ T n -i[— p n -i, which is J^-i-measurable, and (Ap) n := 
p n — K[p n \ Tn-i], which satisfies E[(Ap) n |J r n _i] = 0. Explicitely, 

% 

{Ap) n = 2^ Z TTn (InW-VlW), 

where we used Ii n =i — ir n -i(i) = X n (i) — X n -i(i), as in previous section. 
In the continuous time limit, the first term (Dp) n -\ is going to converge 
towards the drift term and the second one (A/?) n to the noisy source of the 
stochastic differential equation. 
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3.2.1 Brownian diffusive limit 



The Brownian diffusive limit occurs when (i\ip) 7^ for all i. Then p{i\a) 
Po{i)(l + V5T(i\a) + • • • ) for 5 small with, 



PQ {i) = \m\\ r(i|a) = 2Im( (i|// " 



This is the situation we encountered in the previous section on the classical 
diffusive limit, so that we can borrow all results obtained there. 

It is then a simple matter to naively take the continuous time limit of 
the difference equations (17). This limit exists only if {ifj\Hj\ip) = 0, which 
is equivalent to 

(ip\H a \ij}) = for all a, 

a criteria which we assume to hold true. Recall that this scaling limit consists 
in 5 — > with t = n5 fixed. Let us first expand the term (Dp) n _i in power 
of \f5. The term of order \/~5 vanishes due to the condition (ifj\Hj\\f)) = 0, 
and for the term of order 8 we get: (Dp)[ t /s] —5^0 Ld(p t )5, with Linbladian 

L d {p t ) := -z[H s , Pt ] + J2po(i)(C iPt c} - \{c\Ci,(H}), 

i 

where we defined the operators Cj's acting on % s by Cj := — z ^^^ , or 
equivalently 

using the decomposition of Hi on pointer states. Remark that ^2iPo(i)Ci = 
thanks to the assumed condition (ip\H a \^) = 0. Similarly, expanding 

the term (Ap) n using 7r n _i(i) ~ Po(i)[l + STr[(Cj + C])p t ] H ], we get 

lim s ^ (Ap) [t/s] = YtjVjiPtjdXtd), with 

Vjipt) := C jPt + p t C] - p t Ti[{C 3 + C))p t }. 



Note that computing these limits only uses the decomposition (16) of the 
hamiltonian H and not the existence of a preferred pointer state basis 10 
Gathering shows that the Brownian limit of the difference equation (15) is 

dp t = L d (p t ) dt + J2 v j(pt) dX t {j) (18) 
3 

where the -Xt(j)'s are the Gaussian centred processes, with quadratic varia- 
tion 

dX t (i)dX t (j) = {po{i)5i-j - Po{i)po(j)) dt, 



10 The existence of the pointer state basis was however used in determining the statistical 
properties of the fields Xt(i). 
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defined in eq.([8]) and in the discussion around this equation. This is an exam- 
ple of the diffusive Belavkin equation \14\ [T8] . It is important to recall that 
YliPo(i)Ci = since without this condition, but with X t (i) = as we do 



have, eq.(18) would not be positivity preserving [18J. Contact with previous 
sections can be made explicit by recalling that the state probability distribu- 
tion is Qt(a) = (a\pt\a) and by noticing that Tr[(Cj + Cj)pt] = (T(i)) t . We 
only took a naive limit of the difference equation ( |15[ ), to mathematically 
prove the diffusive Belavkin equation for the system density matrix in the 
scaling limit would require more delicate arguments. 

3.2.2 Poissonian jumpy limit 

The Poissonian limit occurs when {i\ip) = 0. This cannot happen for all 
i as {\i}} forms an orthonormal basis of Tip and is non zero. So, we 
assume, for simplicity, that one element of this basis is say \i*) = \ip), 
and all others are orthogonal to i-e. = for all i ^ i* . Then, 

po(i*\a) ~<5 1 an d po(i\a) ^s^o 56(i\a), for i ^ i* , with 

9(i\a) = \(i\H a \iP)\ 2 . 

This is the situation we encountered in the previous section on the classical 
Poisson jumpy limit, so that we can borrow all results obtained there. 

As in the diffusive case, it is a simple matter to naively take the continu- 



ous time limit of the difference equation (17). This only uses the decompo- 
sition the hamiltonian H but the limit exists only if (ip\Hj\ip) = 0, and we 
assume this to be true. Expanding the first term (Dp) n _i to second order 
in Vo, we get {Dp)u/§\ —s^-o L p (pt)5, with Linbladian 

L p (p t ) := -%[H, )Pt ] + ^2(D iPt Dl - i{£>jA,Pt}), 

where we defined the operators A := —i (i\Hj\ip) acting on H s , that is 

A := -i ^2 \ a )(i\H a \i>)(a\. 



a 



To compute the limit of the second term (Ap) n , we notice that, to leading 
order in 6, (i\U\if>) p(if>\U\i) ~ 5 D iP d\ and 7r n _i(i) ~ 5TT[D iPt D\} for i ^ i*, 
and we get lim s ^o(Ap)[t/S] = Ylifr* v i{Pt) dY t (i), with 

Vi(pt) ■■=——T--pt, 

Tr[Dip t Dl\ 

where the last term — p t comes from using dY t (i*) = — Yli^i* dY t (i) when 
computing the contribution of the i*-term in (Ap) n , as in previous section. 
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Gathering shows that the Poissonian limit of the difference equation (15) is 



dp t = L p (p t ) dt + ^ v i{Pt) dY t (i 



(19) 



where the lt(j)'s are the Poisson-like compensated martingales defined in 



eq.(ll) above. That is, 



dN t (i) = dY t {i) + Tr[D iPt Dl] dt, i^i*, 

where dN t (iys are the point processes with intensities (6(i))tdt defined in 
previous section, see e.g. eq. fll2|13p . Note that, using the decomposition of 
the interaction hamiltonian Hj on the pointer state basis, Hj = ^2 a \a){a\ (g> 



H a , we have 



Tr[APiA'] 



E 



8(i\a)Q t (a) = (0(i)) t 



so that the previous equation indeed coincides with eq.(ll). Equation (19) 



is an example of a jumpy Belavkin equation [14]. Let us finally point out 
that the stochastic processes are not the most general one because 

we assumed that they preserve the pointer state basis P] so that the operator 
H s , Ci or Dj are diagonal in the pointer basis. This is of course related to 



the non-demolition property. Eqs.( 18|l9 ) are also peculiar examples of more 
general class of models for continuous quantum measurements whose long 
time behavior leads to purification of mixed states, see e.g. [22], [18] . 
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